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The formation of inclusion bodies (IBs) constitute a frequent event during the production 
of heterologous proteins in bacterial hosts. Although the mechanisms leading to their 
formation are not completely understood, empirical data have been exploited trying to 
predict the aggregation propensity of specific proteins while a great number of strategies 
have been developed to avoid the generation of IBs. However, in many cases, the 
formation of such aggregates can be considered an advantage for basic research as 
for protein production. In this review, we focus on this positive side of IBs formation 
in bacteria. We present a compilation on recent advances on the understanding of IBs 
formation and their utilization as a model to understand protein aggregation and to explore 
strategies to control this process. We include recent information about their composition 
and structure, their use as an attractive approach to produce low cost proteins and other 
promising applications in Biomedicine. 

Keywords: protein aggregation, bacterial inclusion bodies, protein folding, recombinant protein expression, 
conformational disease model, drug delivery systems, nanoparticules 



INTRODUCTION 

The deciphering of the genetic code and the availability of the 
first tools and basic procedures in genetic engineering opened 
the way for protein production in bacteria. This apparently easy 
goal encountered an unexpected difficulty: the accumulation of 
the protein of interest as insoluble form and the generation 
of inclusion bodies (IBs). Since then the generation of IBs has 
been traditionally considered an obstacle to avoid and difficult 
to predict. Notwithstanding in the last decade, IBs have been 
observed from a different perspective and their study gained 
considerable interest. From this point of view, the formation 
of IBs in bacteria is seen as part of a general cellular response 
related to the presence in the cell of unfolded proteins and 
as a pathway for the control of aggregation. From this per- 
spective, IBs constitute a valuable model to better understand 
protein aggregation in eukaryotes, and for the search of specific 
inhibitors or disaggregation approaches, in relation to relevant 
conformational diseases. On the other hand, the production of 
proteins as aggregates in IBs opened new interesting perspec- 
tives for diverse applications in Biomedicine. They can be an 
almost pure source of recombinant protein, and because of their 
particular structural and functional characteristics they can be 
potentially exploited as naturally immobilized enzymes or as 
nanomaterials. In this minireview, we focus on this positive 
side of IBs formation in bacteria. We include recent informa- 
tion about their composition, formation and structure and their 
use as an attractive approach to produce proteins at low cost. 
We also review the role of IBs as a model to understand pro- 
tein aggregation and to explore strategies to control this process. 
Finally we describe some promising applications of bacterial 
IBs as systems for controlled drug delivery and nanotechnology 
applications. 



PROTEIN AGGREGATION AS A CONSERVED CELLULAR 
RESPONSE 

Diverse conditions can lead to an alteration of protein home- 
ostasis and to protein aggregation in all living cells. Among 
others, the presence of unfolded proteins is increased after a heat 
shock or other environmental stress conditions. Some mutations 
can lead to the synthesis of unstable protein structures, as well. 
Another current event is provided by fast and high expression 
of recombinant proteins in bacteria, leading to the formation of 
IBs. Recent evidence indicates that in the cell, the quality control 
system — regulating chaperones and proteases levels — and protein 
aggregation and disaggregation are part of a cellular response to 
altered protein homeostasis. In a recent review Tyedmers et al. 
(2010) described protein aggregation as a regulated process in 
bacteria, yeast and mammalian cells. Interestingly, despite differ- 
ences, similarities of protein aggregation in these cells reflect the 
universality of the response. In particular, the subcellular localiza- 
tion of aggregates is not random but well-defined in each cellular 
type. The IBs in bacteria are mainly localized in the cell poles, and 
also in septation sites, whereas in yeast protein deposits are close 
to vacuoles or to the nucleus. In mammals deposits in aggresomes 
are associated to the nucleolus. Another relevant observation is 
the similarity at the level of protein structure between bacterial 
IBs and the amyloid aggregates characteristic of several human 
diseases, the so-called "conformational diseases", in which pro- 
tein deposits are observed in specific tissues or cells. This is the 
case for several neurodegenerative diseases such as Alzheimer or 
Parkinson, among others. 

Under a heat shock stress, about 150-200 different proteins 
were identified as aggregation-prone in Escherichia coli (Winkler 
et al., 2010). The formation of aggregates can be reversed by the 
combined effect of chaperones and protease activities and the 
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disaggregation machinery. As mentioned, aggregates are mainly 
located at cellular poles by mechanisms not yet completely under- 
stood. Recently Winkler et al. proposed that nucleoid occlusion is 
the main driving force which determines the number and posi- 
tioning of the protein aggregates in E. coli. Also, authors argued 
against the idea that an active targeting mechanism was involved 
in polar localization. Interestingly, polar localization allows an 
asymmetric partitioning of protein aggregates between daughter 
cells. This asymmetry allows an increased cell division rate in the 
population devoid of aggregates, beneficial for the ageing of the 
bacterial cell population (Winkler et al, 2010). 

PREDICTION OF A PROTEIN'S AGGREGATION PROPENSITY 

The aggregation behavior of a protein is strongly determined 
by intrinsic properties of its amino acidic sequence. This obser- 
vation supported the development of computational meth- 
ods to predict protein aggregation propensity. Interestingly, 
recent algorithms take into account not only the primary 
sequence of the polypeptide, but also experimental proteome 
data, including information about cellular localization, cytoso- 
lic, periplasmic and membrane proteins. The analysis per- 
formed by De Groot and Ventura (2010) using AGGRESCAN 
-an algorithm developed by their group, (Conchillo-Sole et al., 
2007; De Groot et al., 2012)-, indicates that the aggre- 
gation propensity of bacterial proteins is associated with 
their length, conformation, location, function, and abundance. 
Recently, in AMYLPRED2 11 predictive methods were consid- 
ered, trying to produce a consensus prediction of amyloido- 
genic determinants/"aggregation-prone" peptides in proteins, 
from sequence alone (Tsolis et al., 2013) (http://biophysics. 
biol.uoa.gr/AMYLPRED2). It is worth mentioning that other 
factors affect the aggregation of recombinant protein expres- 
sion in bacteria such as temperature and growth rate, fusion 
to soluble protein tags, specific codon usage, tRNA availabil- 
ity, and general optimization of codons in the heterologous 
expressed sequence (Cortazzo et al., 2002; Rosano and Ceccarelli, 
2009). 

IBs STRUCTURE: A SPONGE-LIKE SUPRAMOLECULAR 
ORGANIZATION 

The formation and structure of IBs in E. coli expressing heterol- 
ogous proteins has been extensively analyzed. Several excellent 
reviews summarize the current knowledge on IBs structure and 
formation (De Groot et al, 2008, 2009; Wang, 2009; Sabate et al, 
2010; Garcia-Fruitos et al., 2011, 2012). Here we present a concise 
summary and extend a little more on the most recent published 
data. 

IBs are normally observed in the cytoplasm of the producing 
bacteria as dense, large and apparently spherical or cylindrical 
particle, ranging from 0.2 to 1.2 u,m, composed of 80-95% of the 
heterologous expressed protein. IBs may also contain other pro- 
teins, like small heat shock proteins (IbpA and IbpA) and chap- 
erones (like the DnaK system), phospholipids from membranes 
and nucleic acids and other background proteins that co-purify 
with aggregates (Jurgen et al, 2010). Interestingly, the cellular 
composition of IBs evolves during cell growth so that cellular pro- 
teins are predominant during the first steps of formation, while 



heterologous proteins become predominant at the end. These 
changes occur in concert with the evolution of other parameters 
inherent to cell growth, such as division time and growth rate, 
leading to the idea that aging of bacterial population could be 
related to protein aggregation (Lindner et al., 2008). 

Different approaches have revealed that IBs bare a character- 
istic cross-P structure, resembling that found in amyloid fibers 
associated to a wide variety of human degenerative diseases. 
Notwithstanding, IBs may also contain variable amounts of 
natively folded proteins or partially folded proteins that can 
acquire their native conformation even if they are embedded in 
an aggregate (Gonzalez-Montalban et al., 2008). In fact aggre- 
gates have been found to be composed of a wide spectrum of 
conformations, ranging from native conformation to misfolded 
aggregates (Schrodel et al., 2005; Rinas et al., 2007). Moreover, 
aggregation and disaggregation have been shown to occur simul- 
taneously in vivo in actively producing recombinant bacteria 
(Carrio and Villaverde, 2002). The solubilized proteins can then 
reach their native state or alternatively suffer partial proteolytic 
degradation (Corchero et al., 1997; Carrio et al., 1999; Cubarsi 
et al., 2001; Lethanh et al, 2005; Vera et al., 2005; Rinas et al, 
2007). The proportion of functional protein is characteristic of 
the target protein's sequence (Upadhyay et al., 2012), but also 
depends on cell growth temperature (De Groot and Ventura, 
2006; Peternel et al, 2008) and on the genetic traits of the host 
strain (Garcia-Fruitos et al., 2009) 

The cross-f5 sheet regions have been shown to be refractory 
to proteinase K (PK) digestion, while native or native-like struc- 
tures are highly sensitive to PK digestion. Using a GFP reporter 
model, Cano-Garrido et al. (2013) have shown that when IBs are 
submitted to mild digestion with the protease their morphology 
or size is not affected while fluorescence emission and density 
are notably diminished. They propose that IBs present a sponge- 
like structure, where the PK resistant fibrils constitute a scaffold 
which confers mechanical stability to IBs, while the functional, 
PK sensitive fraction accumulate in the gaps of this scaffold. 

In accordance to this, Walther et al. (2014) suggested a mecha- 
nism of pore diffusion out of a barrier layer for the solubilization 
of IBs. They propose that the solubilization process involves dif- 
ferent layers in the IBs: a core, consisting of the IBs agglomerates, 
a reactive and a diffusion layer. The densely packed inner cores 
of protein shrink as the solubilized protein diffuses to the outer 
layers and subsequently through a porous barrier layer into free 
solution. The authors propose that this model correlates well with 
the IBs structure suggested by Cano-Garrido et al. (2013), the bar- 
rier layer corresponding to the amyloid scaffold, which becomes 
visible only as solubilization progresses. 

IBs A MODEL TO STUDY PROTEIN AGGREGATION RELATED 
TO CONFORMATIONAL DISEASES 

The formation and disaggregation of bacterial IBs gained growing 
attention as models to study insoluble protein deposits observed 
in some complex human diseases, as in the so-called "confor- 
mational diseases". This approach is strongly supported by the 
concept that protein aggregation is part of a conserved cellular 
response. Three examples have been chosen to illustrate how IBs 
are employed as a model to study aggregation proteins involved 
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in particular human diseases and as a useful screening approach 
for the search for aggregation inhibitors. 

EXPANDED polyQ IN HUNTINGTON DISEASE 

Huntington disease is a neurodegenerative disorder that affects 
muscle coordination, followed by cognitive and psychiatric prob- 
lems. The disease is caused by mutations in the Huntingtin gene, 
in which expansion of the triplet CAG within the first exon of the 
gene produces a protein carrying stretches of repeated glutamines 
(polyQ). When polyQ exceeds a critical length, huntingtin pro- 
tein undergoes amyloid aggregation (Orr and Zoghbi, 2007). 
E. coli has been employed to follow in vivo the aggregation pro- 
cess of an artificial protein harboring a polyglutamine (polyQ) 
tract (Ignatova et al., 2007). E. coli growth rate was found to be 
sensitive to the protein conformational state, and showed that 
misfolded peptides and soluble aggregates were cytotoxic (Miller 
etal., 2010). 

Related to some pathologies, the relationship between "aggre- 
gation" and "toxicity" is often controversial. This question was 
recently explored in E. coli by expressing PolyQ-Containing 
Ataxin-3 (Invernizzi et al., 2012). For this purpose, the toxicity 
of three variants expressed in E. coli was determined according to 
reduction of growth rate. The authors showed that toxicity was 
correlated to the formation of soluble cytosolic oligomers, but 
not to peptide aggregation. Instead, interestingly, the aggregates 
appeared to be protective against cell toxicity (Invernizzi et al., 
2012). 

PRION PROTEIN EXPRESSION IN BACTERIA 

Prions are protein aggregates with self-perpetuating ability and 
thus infectious (reviewed in Villar-Pique and Ventura, 2012). 
Prions are involved in transmissible spongiform encephalopathies 
(TSEs), a family of rare progressive neurodegenerative disorders 
that affect both humans and animals. Bacterial IBs have been 
exploited as a tool for the study of the structural and functional 
characteristics of prions. Het-s, from the fungus Podospora anse- 
rina, was the first prion protein whose bacterial IBs were shown 
to display amyloid-like properties (Sabate et al., 2009; Wasmer 
et al., 2009). These E. coH-produced aggregates were transfected 
into prion-free fungal strains, and were shown to promote pri- 
onic conversion of Het-s at levels comparable to those induced by 
homologous amyloid fibrils (Sabate et al., 2009). A similar obser- 
vation was reported in the case of the yeast prion Sup35. The IBs 
of this protein were used to induce the [PSI+] prion in [psi-] 
prion-free yeast strains. These results highlight the fact that the 
infectivity rate can be easily modulated by tuning the environ- 
mental conditions during the formation of IBs (Radchenko et al., 
2011; Sabate etal, 2012). 

ALZHEIMER Ap42 AGGREGATION 

There is an increasing interest in developing methods to identify 
cellular factors that trigger the aggregation of proteins inside the 
organism as well as to discover drugs able to interfere with these 
factors. Villar-Pique et al. (2012) describe a fast, cost-effective 
high-throughput approach to study conditions and molecules 
that affect Ap"42 aggregation. The assay is based on the use of 
IBs formed by an Af542-GFP fusion protein in bacteria. They 



showed the ability of the approach to detect the effect of metal 
ions on Af542 aggregation as well as to identify compounds that 
block metal-induced reaction. The authors further propose that, 
as many proteins form IBs when expressed in bacteria (De Groot 
et al, 2009), this approach may have a much larger applicabil- 
ity in the search for aggregation modulators in conformational 
disorders (Villar-Pique et al., 2012). 

METHODS FOR THE RECOVERY OF FUNCTIONAL PROTEINS 
FROM IBs 

Considering IBs as a source of almost pure proteins, one possible 
way is to attain the dissolution of aggregates in order to obtain 
native-folded, active protein. The challenge is then to solubilize 
and refold as much aggregated protein as possible and obtain a 
stable, functional product. The cost of the whole process must 
be taken into consideration if the aim is to produce a large-scale 
manufactured product. 

The rate and yield of the solubilization process seem to be 
influenced by the conditions used, like chaotrope addition, con- 
centration, temperature, pressure, etc. Even if there is no general 
method to solubilize and refold a protein and the strategy in 
each case must be "custom-made", most IBs protein-recovery pro- 
cedures include the following steps (Burgess, 2009; Basu et al., 
2011): (i) overexpression of the selected protein in an appropri- 
ate host strain; (ii) isolation of IBs; (iii) solubilization of IBs; 
(iv) refolding (including disulfide bond formation when neces- 
sary); (v) high-resolution chromatography (vi) quality control of 
obtained material. 

Solubilization and refolding are the most critical steps in 
the procedure and successful conditions still depend mostly 
on trial-and-error strategies (Burgess, 2009). Notwithstanding, 
efforts have been made to rationalize the refolding step, the 
free REFOLD database (http://refold.med.monash.edu.au; Chow 
et al., 2006a,b) can be a useful tool for the design of procedures 
for the refolding and purification of recombinant proteins. 

There are very good recent reviews that compile the different 
methods employed in the different steps of the recovery process. 
In accordance to the above mentioned findings on the struc- 
ture and solubilization mechanisms of IBs, the tendency is to 
employ milder extraction conditions, avoiding strong denatura- 
tion and refolding conditions. In Table 1 we listed the most recent 
approaches reported in the last 3 years. 

IBs NANOPARTICULES AND CONTROLLED DELIVERY OF 
DRUGS 

A recent report showed that peptide hormones of the pituitary 
gland are stored intracellularly as amyloid aggregates within the 
secretion granules. The amyloid cross-fi structure provides a very 
stable and highly compacted state from which controlled release 
of functional monomeric hormone can take place upon signal- 
ing (Maji et al., 2009). A somewhat similar situation occurs with 
bacterial IBs. In fact, IBs are composed by amyloid-like aggregates 
from which substantial amounts of functional recombinant pro- 
tein can be released in vivo as well as under mild (non-denaturing) 
conditions in vitro. This feature — reminiscent of a drug delivery 
system — led to the concept of the "nanopills," i.e., nanoparti- 
cles which are able to release proteins with therapeutic effects 
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Table 1 | Novel approaches employed in the recovery of proteins from IBs. 



Recovery step 


Strategy 


Advantage 


Protein 


References 




Solubilization/Refolding 


6M n-propanol 


Chaotropic effect at high 
concentration/kosmotropic effect at 
dilution helps refolding 


r-hGH 


Singh et al., 


2012 


Solubilization/Refolding 


Aminoacid-based detergent + arginine 
assisted refolding 


Arg minimizes protein-protein 
interactions 


scFvs 


Kudou et al. 
2011 a,b 


■ 


Solubilization 


Non-reducing buffer + low concentration 
of urea 


Avoids inappropriate disulfide bonds 
and aggregates. Suitable for proteins 
that aggregate with native-like 
structures 


PrP 


Walsh et al. 


, 2012 


Solubilization 


Microwave assistance 


Shortens solubilization time 


various 


Datta etal., 


2013 


nt; i uiui My 


niyu iiyuiubLdLio (jicbbuic; 


Mn n ra\ /i fi i ic rirfi+oin Honati ration 
INU pifcJVIUUb piULfcilll UfcM Id LUI d LION 

required 


LECT2 


Zheng et al. 


, 2013 


Refolding 


Dialysis against PE-PEG 


Simple and cheap method; high yield 


FhuA A1-159 


Dworeck et al., 




(Polyethylene- Polyethyleneglycol) 






2011 




Solubilization 


Alkaline-shock 


No use of chaotropic agents 


adiponectin 


Heiker et al. 


, 2010 


Refolding 


Solid-phase refolding in cation-exchange 
resin with decreasing gradient of urea 


Minimize time and number of steps 


BMP-2 dimers 


Rane et al., 


2013 



directly from the inside of cells (Vazquez et al., 2012). As a proof 
of concept, Vazquez et al. demonstrated that the addition of dif- 
ferent types of IBs to the culture medium of mammalian cells, 
for example IBs composed of HSP70, leukemia inhibitory factor 
or catalase, were able to rescue them from cis-platinum, serum 
deprivation or oxidative stress, respectively. Also, dihydrofolate 
reductase IBs complemented the intrinsic cell deficiency of this 
enzyme. Furthermore, IBs are spontaneously internalized by cul- 
tured cells, as was directly demonstrated with green fluorescent 
protein IBs (Villaverde et al., 2012). Following a similar approach, 
Liovic et al. (2012) introduced keratin 14 IBs into epithelial 
cells which do not normally express this protein, and found that 
intracellular keratin filaments start to form. 

CONCLUDING REMARKS 

The common view of IBs as undesirable by-products in recombi- 
nant protein production have been lastly reconsidered, in view of 
the potential of these structures for different purposes. On the 
other hand, IBs provide to constitute an easy to handle model 
for the study of the molecular basis of conformational diseases. 
On the other hand, IBs provide a source of almost pure polypep- 
tides and are a potentially useful source of ready-to-use protein. 
In this sense, the aim is then to obtain IBs containing as much 
folded, functional protein as possible. The design of strategies to 
reach this aim requires a deep knowledge of IB's structure and 
formation, in order to identify possible molecular targets which 
can be "tuned" to improve the protein recovery yield or to obtain 
IBs with the desired characteristics to allow their use as enzyme 
carriers or nanomaterials. 
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